MA in Computational Arts blog › the Unknown Person

the Unknown Person

The Unknown Person is a screen-based installation that connects a piece of the artist’s family history to Britain’s post-colonial reality. Using machine learning processes and facial recognition algorithms, the piece interrogates the gaze of surveillance and social control systems through liminal spaces in the city.

produced by: Eddie Wong

Introduction

In 1949, my grandfather left his family and disappeared into the Malayan jungle to fight the British as a communist guerrilla soldier. After three years of violent rebellion against the armed forces of the British colonial government, he was shot and killed. My grandmother and her children witnessed from afar and in silence how his body was dragged out to be displayed in public. Under the threat of arrest and execution, my grandmother had to deny relations with the deceased when forced to recognize the corpse. Subsequently, my grandmother escaped with five children in tow and became fugitives of the law. These three incidents in my family history formed conceptual structure of the piece.

The Unknown Person interrogates the tension inherent in living in a highly scrutinized world to craft a personal narrative that connects Britain’s violent colonial past to a piece of my family history. What are the parallels between the surveillance and recognition systems of today to the rebellion that the artist's grandparents have lived through and perished? The work reimagines what the machine's gaze at an 'invented person' ingested into a data form might look like. By exploiting neural network processes similar to those in surveillance detection systems, I situate myself at liminal spaces within the City of London – a symbolic zone of post-colonial reality and subsequently redacted myself from its surface.

Concept

The starting point of the work was when I wrote the final term essay The City as a Fictioning Machine for Research and Theory module. I researched on the City of London as a ‘fiction-generating machine’ and hypothesize that the City could be considered as a complex piece of algorithm. I situated myself and my family history from a postcolonial perspective to explore how family narratives are woven.

How can I play these tensions off another – my family's narrative and the city's'? And how can I use computational techniques to bring to life this going back and forth? I decided on the story of my grandfather's communist guerilla activities and my grandmother’s survival in the aftermath. I wanted to establish a link between the surveillance and recognition systems of today to that of my grandparent's time. My grandmother could be seen as a single data point who denied recognizing the corpse of her husband. She defied and refused the optics of state-machines and was able to survive. Would it be possible for her to survive in today’s ultra-scrutinized world? To refuse the all-seeing eye of an Artificial Intelligent dragnet, that can collect, track and control infinite data points on a single individual and all his/her relations?

The driving motif of this piece is optics. I wanted to play off the tension of who’s watching whom and who is being watched through what means in a culture of hyper-surveillance. Optics are present throughout my family narrative too. The colonial government sees my grandfather as a terrorist, the anecdotes of his death and grandmother’s survival are from eyewitness anecdotes and of course, the visible/invisible appearance of my grandfather. This motif is expressed visually in the final output of the film.

The City of London in the piece was a character in itself that observes through its security cameras as well as an implied passive observer of the British Empire's history. From a post-colonial and cultural studies perspective, the cultural theorist Homi Bhabha refers to the liminal having the potential to be a disturbing influence. “This interstitial passage between fixed identifications opens up the possibility of cultural hybridity that entertains difference without an assumed or imposed hierarchy” (Bhabha 5). Therefore, I have chosen the liminal spaces within the City to situate myself as a disruptive wedge at the in-betweenness of the City’s entanglement with Britain's post-colonial reality. The scaffolding in the final installation was the physical representation of the City as a process that generates this reality. In a literal sense, the scaffolding is a latticework that holds my family stories.

I filmed myself having a phone conversation with family members back home recounting the incidents of my grandparents. Using a combination of neural network and object detection processes that are similar to the ones used in the surveillance mechanism, I’ve redacted myself and then fill myself back into the surface foregrounds the tension found in liminal spaces at the threshold of the surveillance apparatus of the city. The flickering pulses from infrared security systems are made explicit, while I become a subject of surveillance, whose specific objectivity is subsumed and ingested into the data form - rendering the self as a smear across the City’s surface, a ghostly embodiment of my grandparent’s stance of refusal.

Background research

Machine Learning

I’ve been interested in what Machine Learning (ML) could generate since I’ve experimented with Generative Adversarial Network (GAN) by playing with the pix2pix repository for my Research Project last year. I knew I wanted to continue to experiment with Machine Learning and do something with the aesthetics of this project. The research informs the visual design of the piece as I wanted to play with the idea of reflection, extraction (reflecting on the extractive capitalism of the City). From working with GAN, I was fascinated with its visual potential that is suggestive of the erasure of space/time boundaries. This duality fits in with the narrative material but I wanted something other than the generative and hallucinatory feeling that GAN outputs.

For the initial visual design of the film, I wanted to keep in line with the "look" of machine learning surveillance systems such as facial recognition and object detection to inform the design of the images. I played with object detection models such as YOLO and face detection models. The Machine Learning technology that I’ve implemented isn’t particularly novel per se. I first came across an experiment by Chris Harris where he uses A.I to detect and then remove vehicles from the street. Another artist Mikhail Rybakov had done tests using A.I to delete bodies. He combined two open-sourced GitHub repositories MaskRCNN for body recognition and Generative Image Inpainting with Contextual Attention to fill back in the gap. When I contacted the artist personally to ask him more about these tests, Mr. Rybakov was graceful enough to offer me some tips on workflow.

Video

Lisa Rave’s Europium was a great inspiration to how I can structure the stories of contemporary concerns of a larger colonial legacy and its lasting impact on the environment. She masterfully weaved through family narratives, extraction of natural resources and interlinks these narratives giving each their own space and context, but she also often returns to the spiral of a nautilus mollusk as a driving motif.

I've had reservations about working with video as a medium. Being on a computational course, I felt the need to make things...computational. However, video art has its place in the art world and there some great machine learning videos out there. In this case, the narrative material of my family story drove the medium. The final piece took the shape of a series of fragmented video essays, which reflected on the fractured nature of how the stories were recounted and remembered. I felt this to be the best, if not the only way to tell the complicated story of my family. The cinematography was crucial to express a sense of liminality. The location choices, each has its historical significance and provided the post-colonial subtext to the narrative.

Technical

Machine Learning

For the machine learning model, I first experimented with installing the MaskRCNN and InFill Painting GitHub repositories. The tutorial series and repo by Mark Jay was an immense help in getting started (Mask RCNN with Keras and Tensorflow) with some early test shots on still images and videos. Although I successfully got the model up and running on the local GPU, it struggled to run smoothly on my machine. As well as that, the model outputs one frame per- second video which was not as impressive to look at.

After extensive experimentation and research, I decided on alternative models to create a similar effect. I relied heavily on the remote GPU and model libraries of RunwayML. The platform provided more models and flexibility for the workflow of producing image sequence files. I decided to use the Deep Lab model by Gene Kogan to extract semantic maps from objects in images. The model was able to capture multiple objects at various scales based on COCO-Stuff 10k/164k and PASCAL VOC 2012 datasets. In my case. I only needed inference from a single image– that of a Person. The model saves a black and white mask out of the image which I then used as a source for segmentation input for the image inpainting task. For that, I've used the Deep Fill model by feeding in the black and white images generated from Deep Lab as segmentation input to fill the missing regions of the image (footage of myself). The model comes pre-trained on the Places2 dataset by MIT which contains a lot of images of outdoor places. I hoped, therefore, it would be able to bias the filling in of the 'Person's' image towards more natural scenes of its surroundings.

One of the challenges with the image inpainting algorithm was the low-resolution output. I needed the output to be relatively high res as it will be displayed on video monitors. The dataset came pre-trained with images of resolution 256 x 256 and the largest hole size 128x128. Above that, the image resolution deteriorates. I was a concern with the pixelation of a 360p video when broadcasted on large monitors. I had to do an upscaling pass from 360p back to 720p and then adding it back to the original image information and mask. That way I kept the original quality everywhere except for the infilled area, and the infilled area is smooth and a bit blurry.

The end result was satisfactory. The paints me out with imperfectly but that adds to the visual potency, otherwise, perfect invisibility meant there would be nothing to see! It posed the question – am I a figure hiding from the gaze or has the camera penetrated even the liminal spaces? The ambiguity brought to life the back and forth of the tension between the self and the story while revealing the liminality of the space.

Interaction – Face Detection

For the interactive component of the piece, I revisited the final term assignment for the Workshop and Creative Coding. I built a face-detection algorithm in Openframeworks and modified the OfxFaceTracker2 addon by stripping the wireframe face-track mask down to its bare minimum and ran it on webcam feed. The addon is based on dlib’s face detection API, that generates the various facial landmark (eyes, nose, mouth etc). My approach was to design the ‘visual look’ that is suggestive of a facial dataset used face detection algorithm used in machine learning training.

One of my concerns with the implementation of an interactive element in my overall piece was that it may seem to the audience (and examiners) to be superfluous. There are two reasons why I’ve added interaction to the piece. On a conceptual level, the webcam feed that is detecting faces on the piece implies that the City’s surveillance apparatus is observing. The viewer then becomes complicit in the system of surveillance and control. On a practical level, it was designed with the final scaffolding structure in mind, so that it draws the viewer into walking around the structure, hence bringing forth the multi-dimensional qualities of the structure. The label on the bounding box around the face-detection mask says "Unknown person". It implies a failure of the machine to label an object. At the same time, the Unknown Person could be just anyone. One of the main challenges I’ve faced was running Openframeworks in Linux mode on a Rasberry Pi. The face-tracker addon was apparently incompatible with the Pi’s operating system. For the exhibition, I had to run the face-detection component from my laptop, which, fortunately was without any issues.

Installation

The installation ran on 6 x monitors, 4 x Raspberry Pis configured to play video upon booting and controlled via SSH, a laptop running the face-tracking code on OpenFameworks, a wide-lens webcam for camera input. I relied on this video looper code from Adafruit tutorial to play movies off of USB drives when inserted in the Raspberry Pi. It calls on the default video player for Pis, omxplayer and can play most videos encoded with the H.264 video codec and in a video format .mp4 format. This is the best solution for my situation because omxplayer will use the Pi's GPU (graphics processing unit) to efficiently play videos that are 720p and even 1080p.

The piece's scaffolding structure measures 8 ft by 7 ft high, an industrial scaffolding. I attached six monitors with cable ties. The monitors (courtesy of tech office) were reappropriated from recycles. The stripped-down, bare metal look adds to the aesthetic value to the piece. My main challenge for the installation stage was mainly working with the scale of the scaffolding. Due to the thickness of the bars, I had to custom fabricate brackets for each of the screens to attached it to the bar. This was not possible due to time constraints as well as the lack of screw holes at the back of these monitors because they had the frames removed. I found an efficient way was to secure the screens by threading the cable ties through the monitors and over the bars. Somehow, that held the screens up although it may not the most elegant solution.

Future development

The future development for this work is currently in progress. I am speaking with performers and a VJ on collaborating to use the machine learning techniques on this piece as a starting point for their own work. The natural next step would be experimenting with deep learning models trained with a smaller, tightly curated dataset. For example, I’d like to try redacting more specific objects such as building types and filling them back in with a trained model from the dataset. I’d also like to develop explore the redacting images in real-time via a camera feed, perhaps in combination with the face tracker algorithm on OpenFrameworks.

Another aspect that I’d like to play within RunwayML is to chain three models together – perhaps the third one being a GAN that fills in the gap. It’d be interesting to see how a neural network works when feedback against itself. I aim to present this work at an ML or related creative technology conferences and show this work, at a smaller scale in at least one exhibition. I’d like to keep learning more about deep learning algorithms and how it’d be used as a storytelling medium in itself. I think it’s important to offer an alternative approach to using ML amidst the increasing democratization of ML tools. On a professional level, the City of London is a fertile ground for investigation in the context of post-colonial liminality. I will pursue this research independently or academically moving forward.

Self-evaluation

Overall, I am pleased with the aesthetic of the neural network redaction effect for the videos. When juxtaposed against the imagery of the City of London, along with the text of my family narrative and displayed on a large scaffolding structure, the piece evoked an otherworldly sense of awe and drama which was as surprising as it was satisfying. A student from the filmmaking course commented that it was ‘strangely immersive and cinematic’.

I felt that the piece delivered on a conceptual level. The structure of the scaffolding succeeded in representing the imposing and impersonal nature of the City. The placement of the screens within and around the scaffolding structure gave an impression of a security control room. This goes back to my initial aim of playing off several large, complex themes with one another (self, machine, place). In the end, the story of my family 'came through'.

The outcome of the installation piece was generally well-received but it was not all according to meticulous planning. On a practical level, the scale of the scaffolding was a massive undertaking (literally speaking!). There were times when I felt that I was entering stage-setting work on a film set, which I was not equipped to handle. The number of monitors and Raspberry Pis proved to be difficult to set up and managed. I could also have used a variation of screen sizes to give the overall piece a better visual quality. Due to limited resources (all the screens are recycled), this was not possible. One of the screens died but the presence of a failed piece of electronic resonated with the sombreness of the story. The face tracking component might have seemed slightly out of place. This was remedied by placing the interaction screen inside the scaffolding so the viewer had to peer inside the structure to interact with it. The code could've been developed further. For instance, having each face leave a trail on-screen and eventually accumulating to fill it up would be quite interesting. Additionally, I could've included a code that stores an image each time it detects face so I can keep track of who has interacted with it.

The making of this piece has stretched my technical limits. When I came on the course with no coding experience to speak of, I wanted to explore machine learning, learn how to make art with code and was (am still) interested in storytelling. From that point of view, I have met all my educational goals in this course.

References

Code

RunwayML

Deep Fill – http://jiahuiyu.com/deepfill/

Deep Lab (Gene Kogan) – https://github.com/genekogan/deeplab-pytorch

Generative InPainting (Jiahui Yu) – https://github.com/JiahuiYu/generative_inpainting

Mask RCNN

Mask RCNN Repo: https://github.com/matterport/

Mask RCNN paper: https://arxiv.org/pdf/1703.06870.pdf

Mask RCNN with Keras and Tensorflow (pt.1) Setup and Installation

Mask RCNN with Keras and Tensorflow (pt.3) process video

OpenFrameworks Face Tracker

http://dlib.net/cnn_face_detector.py.html

https://github.com/HalfdanJ/ofxFaceTracker2

Raspberry Pi Video Looper

https://learn.adafruit.com/raspberry-pi-video-looper/usage#tips-for-looping-videos

Research

Bhabha, Homi K. The Location of Culture. London: Routledge, 1994.

Lisa Rave, http://wholewallfilms.com/europium/

More selected projects

The Critical Value of Digital Remix Practices in the Arts for Producing Culture and Innovation
Computational Art - Research and Theory Lab

A critical reflection on the online exhibition system and a direction toward building advanced cyber exhibitions (A focus on survey and pilot theory)
Computational Art - Research and Theory Lab term 2 projects

Face-off: The Challenges of Facial Recognition Technology in an era of Cosmetic Surgical Modification
comp-art-research-2020-term2

Understanding Black British Information and Communication Technology: The Metamorphosis of Storytelling Into The Digital Sphere
comp-art-research-2019-term2

A study of interactive video and self awareness with the interrelation with technology and cybernetics
comp-art-research-2018-term2

Analysis of the relationship between and development of artificial intelligence art and human art
comp-art-research-2018-term2

Inter-Personal Relations in Interactive Virtual Reality Environment: Communication Network and Role of Technology
comp-art-research-2018-term2

I SMELL THE SMELL OF YOUR BODY, I SMELL THE SMELL OF YOUR BODY IN SOCIETY, I SMELL THE SMELL OF YOUR BODY IN CONNECTION TO ME
comp-art-research-2018-term1

More selected projects

Transitory Sketchfinal-projects-2021

Learning a blind eyefinal-projects-2021

OneNightfinal-projects-2021

Input Machine: Type-Writerfinal-projects-2021

Cobalt Blues – Bokanifinal-projects-2021

Running A Little Latefinal-projects-2021

Maneofinal-projects-2021

The Shape of Waterfinal-projects-2021

The Poetry Of Changefinal-projects-2021

M A R B L E R U Nfinal-projects-2021

void avoid(){}final-projects-2021

A Beautiful Temptationfinal-projects-2021.

Cloud ARfinal-projects-2021

They were expected to see what stuff she was made offinal-projects-2021

Memory Miniaturefinal-projects-2021

Mono no awarefinal-projects-2021

final-projects-2021

Networked Veilfinal-projects-2021

Arthritis

Memories Of Cities That Don’t Exist. “the life we lived without a City.”final-projects-2021

Inside is minefinal-projects-2021

Mishmash us final-projects-2021

Impression final-projects-2021

Message in a Bottlefinal-projects-2021

The Picture and the Portraitfinal-projects-2021

pollinating every plant on earth with satellitesfinal-projects-2021

Pavement Radiofinal-projects-2021

Rumors from Cyberworldfinal-projects-2021

Irregular Flutterfinal-projects-2021

Digital Labyrinthfinal-projects-2021

Trametesfinal-projects-2021

Aleph Null / with-lasers / nathan.adamsfinal-projects-2021

Talking to my Brotherfinal-projects-2021

SHA-BEfinal-projects-2021

Mirko Febbo 3D Brain Mapping Heatmapfinal-projects-2021

Between Firefinal-projects-2021

And Then They Were Gonefinal-projects-2021

/MarksMade/StainedLand/*.pngfinal-projects-2021

Wander [001] – The future nomadfinal-projects-2021

Sonic Tree (Fungal Record)final-projects-2021

Listen Back to AIfinal-projects-2021

Fish, Chips, Cup O’Tea final-projects-2021

NV WA final-projects-2021

If you could never forget a dream, would you still remember me ?final-projects-2021

Timothy Lee – Jitteryfinal-projects-2021

Mama Jai’s Farmfinal-projects-2021

Codexfinal-projects-2021

Spatialfinal-projects-2021

Reconciliationfinal-projects-2021

Tectonic Lingeringfinal-projects-2021

Je me souviensfinal-projects-2021

AEco.5ofinal-projects-2021

The Unbearable Lightness Of Mixed Signalsfinal-projects-2021

In The Windfinal-projects-2021

HeadCasefinal-projects-2021

Chris Newth – Final Project – “Resuscitating Annie”final-projects-2021

The Artifice of the Image : Language and Image in a Networked Environmentcomp-art-research-2020-term2

Generative Music Citieswcc-term-b-2020

Time and Memories Projection Mappingwcc-term-a-2020

State of Returnwcc-term-a-2020

Broadway Boogie Woogiewcc-term-a-2020

Deep (Blue) Fakewcc-term-b-2020

The Entangled Nature of Human and Artificial Intelligencecomp-art-research-2020-term2

The Critical Value of Digital Remix Practices in the Arts for Producing Culture and InnovationComputational Art - Research and Theory Lab

Sketches of Social Influencewcc-term-a-2020

The Final Cutwcc-term-b-2020

Transmaterial Worlding: Using Digital Computation to Unveil Matter’s Animacy.comp-art-research-2020-term2

The Fragmented Mirrorbcc-term b-2021

‘We were never meant to survive’wcc-term-b-2020

blitzAR – The city through the eyes of Augmented Realitycomp-art-research-2020-term2

Experimental music performancecomp-art-research-2020-term2

Mirko Febbo Consciousness Disordercomp-art-research-2020-term2

Not My Typecomp-art-research-2020-term2

Divination ProjectWorkshops in Creative Coding Term 2 (2020)

Robotic Shadow Puppetrywcc-term-b-2020

Cumulative Disadvantagecomp-art-research-2020-term2

A Study Into Media Archaeology & The History Of Computational Artcomp-art-research-2020-term2

Mirko Febbo – EEG + XY plotter + Visualizationwcc-term-a-2020

This Creature Does Not Existcomp-art-research-2020-term2

Transitory Sketch
final-projects-2021

Learning a blind eye
final-projects-2021

OneNight
final-projects-2021

Input Machine: Type-Writer
final-projects-2021

Cobalt Blues – Bokani
final-projects-2021

Running A Little Late
final-projects-2021

Maneo
final-projects-2021

The Shape of Water
final-projects-2021

The Poetry Of Change
final-projects-2021

M A R B L E R U N
final-projects-2021

void avoid(){}
final-projects-2021

A Beautiful Temptation
final-projects-2021.

Cloud AR
final-projects-2021

They were expected to see what stuff she was made of
final-projects-2021

Memory Miniature
final-projects-2021

Mono no aware
final-projects-2021

Networked Veil
final-projects-2021

Memories Of Cities That Don’t Exist. “the life we lived without a City.”
final-projects-2021

Inside is mine
final-projects-2021

Mishmash us
final-projects-2021

Impression
final-projects-2021

Message in a Bottle
final-projects-2021

The Picture and the Portrait
final-projects-2021

pollinating every plant on earth with satellites
final-projects-2021

Pavement Radio
final-projects-2021

Rumors from Cyberworld
final-projects-2021

Irregular Flutter
final-projects-2021

Digital Labyrinth
final-projects-2021

Trametes
final-projects-2021

Aleph Null / with-lasers / nathan.adams
final-projects-2021

Talking to my Brother
final-projects-2021

SHA-BE
final-projects-2021

Mirko Febbo 3D Brain Mapping Heatmap
final-projects-2021

Between Fire
final-projects-2021

And Then They Were Gone
final-projects-2021

/MarksMade/StainedLand/*.png
final-projects-2021

Wander [001] – The future nomad
final-projects-2021

Sonic Tree (Fungal Record)
final-projects-2021

Listen Back to AI
final-projects-2021

Fish, Chips, Cup O’Tea
final-projects-2021

NV WA
final-projects-2021

If you could never forget a dream, would you still remember me ?
final-projects-2021

Timothy Lee – Jittery
final-projects-2021

Mama Jai’s Farm
final-projects-2021

Codex
final-projects-2021

Spatial
final-projects-2021

Reconciliation
final-projects-2021

Tectonic Lingering
final-projects-2021

Je me souviens
final-projects-2021

AEco.5o
final-projects-2021

The Unbearable Lightness Of Mixed Signals
final-projects-2021

In The Wind
final-projects-2021

HeadCase
final-projects-2021

Chris Newth – Final Project – “Resuscitating Annie”
final-projects-2021

The Artifice of the Image : Language and Image in a Networked Environment
comp-art-research-2020-term2

Generative Music Cities
wcc-term-b-2020

Time and Memories Projection Mapping
wcc-term-a-2020

State of Return
wcc-term-a-2020

Broadway Boogie Woogie
wcc-term-a-2020

Deep (Blue) Fake
wcc-term-b-2020

The Entangled Nature of Human and Artificial Intelligence
comp-art-research-2020-term2

The Critical Value of Digital Remix Practices in the Arts for Producing Culture and Innovation
Computational Art - Research and Theory Lab

Sketches of Social Influence
wcc-term-a-2020

The Final Cut
wcc-term-b-2020

Transmaterial Worlding: Using Digital Computation to Unveil Matter’s Animacy.
comp-art-research-2020-term2

The Fragmented Mirror
bcc-term b-2021

‘We were never meant to survive’
wcc-term-b-2020

blitzAR – The city through the eyes of Augmented Reality
comp-art-research-2020-term2

Experimental music performance
comp-art-research-2020-term2

Mirko Febbo Consciousness Disorder
comp-art-research-2020-term2

Not My Type
comp-art-research-2020-term2

Divination Project
Workshops in Creative Coding Term 2 (2020)

Robotic Shadow Puppetry
wcc-term-b-2020

Cumulative Disadvantage
comp-art-research-2020-term2

A Study Into Media Archaeology & The History Of Computational Art
comp-art-research-2020-term2

Mirko Febbo – EEG + XY plotter + Visualization
wcc-term-a-2020

This Creature Does Not Exist
comp-art-research-2020-term2

Dialogue & Embodied Container
comp-art-research-2020-term2

Jeff in a Jar 3.2
comp-art-research-2020-term2

99 Truths / Conspiracy Bot
comp-art-research-2020-term2